Hypergraph-Clustering Method Based on an Improved Apriori Algorithm
نویسندگان
چکیده
With the complexity and variability of data structures dimensions, traditional clustering algorithms face various challenges. The integration network science has become a popular field exploration. One main challenges is how to handle large-scale complex high-dimensional effectively. Hypergraphs can accurately represent multidimensional heterogeneous data, making them important for improving performance. In this paper, we propose hypergraph-clustering method dubbed “high-dimensional method” based on hypergraph partitioning using an improved Apriori algorithm (HDHPA). First, constructs association rule algorithm, where frequent itemsets existing in are treated as hyperedges. Then, different mined parallel obtain hyperedges with corresponding ranks, avoiding generation redundant rules mining efficiency. Next, use dense subgraph partition (DSP) divide into multiple subclusters. Finally, merge subclusters through sub-hypergraphs results. advantage lies its model discretize between space, which further enhances effectiveness accuracy clustering. We comprehensively compare proposed HDHPA several advanced methods seven types datasets then their running times. results show that evaluation index values generally superior all other methods. maximum ARI value reach 0.834, increase 42%, average time lower than All all, exhibits excellent comparable performance real networks. research paper provide effective solution processing analyzing also conducive broadening application range techniques.
منابع مشابه
An improved opposition-based Crow Search Algorithm for Data Clustering
Data clustering is an ideal way of working with a huge amount of data and looking for a structure in the dataset. In other words, clustering is the classification of the same data; the similarity among the data in a cluster is maximum and the similarity among the data in the different clusters is minimal. The innovation of this paper is a clustering method based on the Crow Search Algorithm (CS...
متن کاملA Hybrid Time Series Clustering Method Based on Fuzzy C-Means Algorithm: An Agreement Based Clustering Approach
In recent years, the advancement of information gathering technologies such as GPS and GSM networks have led to huge complex datasets such as time series and trajectories. As a result it is essential to use appropriate methods to analyze the produced large raw datasets. Extracting useful information from large data sets has always been one of the most important challenges in different sciences,...
متن کاملAn Improved Apriori Algorithm for Association Rules
There are several mining algorithms of association rules. One of the most popular algorithms is Apriori that is used to extract frequent itemsets from large database and getting the association rule for discovering the knowledge. Based on this algorithm, this paper indicates the limitation of the original Apriori algorithm of wasting time for scanning the whole database searching on the frequen...
متن کاملMedical Diagnosis Data Mining Based on Improved Apriori Algorithm
With the wide application of computer science and technology, the amount of data generated by various disciplines increased rapidly. In order to discover valuable knowledge in these databases, people use data mining methods to solve this problem. The application of association rule mining is an important research topic in data mining. As the association rule technology becomes more mature, it i...
متن کاملEnterprise Human Resources Information Mining Based on Improved Apriori Algorithm
With the unceasing development of information and technology in today’s modern society, enterprises’ demand of human resources information mining is getting bigger and bigger. Based on the enterprise human resources information mining situation, this paper puts forward a kind of improved Apriori algorithm based model on the enterprise human resources information mining, this model introduced da...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app131910577